A large-scale prediction of protein-protein interactions based on random forest and matrix of sequence

نویسندگان

چکیده

Protein-protein interaction (PPIs) is an important part of many life activities in organisms, and the prediction protein-protein interactions closely related to protein function, disease occurrence, treatment. In order optimize performance interactions, here a RT-MOS model was constructed based on Random Forest (RF) Matrix Sequence (MOS) predict interactions. Firstly, MOS used encode sequences into 29-dimensional feature vector; Then, build random forest, optimized evaluated using test set; Finally, for prediction. The experimental results show that accuracy rates benchmark dataset non-redundant are 97.18% 91.34%, respectively, accuracies four external datasets C.elegans, Drosophila, E.coli H.sapiens 96.21%, 97.86%, 97.54% 97.75%, respectively. Compared with existing methods, it found superior methods. has advantages saving time, preventing overfitting high accuracy, suitable large-scale PPIs

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

: the effect of sericin levels (silk glue protein) on rate of in vitro maturation, fertilization and culture of sheep oocytes

هدف از آزمایش اول بررسی اثر سطوح مختلف سریسین [0 (control), 0.1, 0.5, 1.0, 2.5 %] افزوده شده به محیط , ivm بر cumulus cell expansion، بلوغ هسته و توسعه متوالی جنین، در گوسفندان نژاد سنجابی در فصل تولید مثلی می باشد. از سرگیری میوز به وسیله خارج شدن اولین پولار بادی اندازه گیری و هم چنین درصد رسیدن جنین های دو سلولی به مرحله کلیواژ و بلاستوسیت نیز به عنوان نشانه ای از میزان شایستگی توسعه اولیه ج...

Prediction of protein-protein interactions using random decision forest framework

MOTIVATION Protein interactions are of biological interest because they orchestrate a number of cellular processes such as metabolic pathways and immunological recognition. Domains are the building blocks of proteins; therefore, proteins are assumed to interact as a result of their interacting domains. Many domain-based models for protein interaction prediction have been developed, and prelimin...

متن کامل

Sequence-based prediction of RNA-protein interactions

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xxiv CHAPTER 1. OVERVIEW . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.1 Dissertation Organization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.2 Experimental Methods to Identify RNA-Protein Interactions . . . . . . . . . . 4 1.3 Computational Prediction of RNA-Protein Interfaces ....

متن کامل

Seeing the trees through the forest: sequence-based homo- and heteromeric protein-protein interaction sites prediction using random forest

Motivation Genome sequencing is producing an ever-increasing amount of associated protein sequences. Few of these sequences have experimentally validated annotations, however, and computational predictions are becoming increasingly successful in producing such annotations. One key challenge remains the prediction of the amino acids in a given protein sequence that are involved in protein-protei...

متن کامل

Detecting Protein-Protein Interactions with a Novel Matrix-Based Protein Sequence Representation and Support Vector Machines

Proteins and their interactions lie at the heart of most underlying biological processes. Consequently, correct detection of protein-protein interactions (PPIs) is of fundamental importance to understand the molecular mechanisms in biological systems. Although the convenience brought by high-throughput experiment in technological advances makes it possible to detect a large amount of PPIs, the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: BIO web of conferences

سال: 2022

ISSN: ['2273-1709', '2117-4458']

DOI: https://doi.org/10.1051/bioconf/20225501017